Development of an Open Source Natural Language Generation Tool for Finnish
نویسنده
چکیده
We present an open source Python library to automatically produce syntactically correct Finnish sentences when only lemmas and their relations are provided. The tool resolves automatically morphosyntax in the sentence such as agreement and government rules and uses Omorfi to produce the correct morphological forms. In this paper, we discuss how case government can be learned automatically from a corpus and incorporated as a part of the natural language generation tool. We also present how agreement rules are modelled in the system and discuss the use cases of the tool such as its initial use as part of a computational creativity system, called Poem Machine.
منابع مشابه
PapyrusRT: Modelling and Code Generation (Invited Presentation)
In this talk we introduce PapyrusRT, an open-source, industrial-strength model-driven development environment for real-time and embedded systems, implementing UML-RT [2,3], a UML-based language. PapyrusRT is implemented on top of Papyrus, an Eclipse modelling tool for UML, SysML, and EMF models. We describe the motivations for this project and in particular for the need of an open-source enviro...
متن کاملDo wordnets also improve human performance on NLP tasks?
FinnWordNet is a wordnet for Finnish that complies with the format of the Princeton WordNet (PWN) (Fellbaum, 1998). It was built by translating the Princeton WordNet 3.0 synsets into Finnish by human translators. It is open source and contains 117000 synsets. The Finnish translations were inserted into the PWN structure resulting in a bilingual lexical database. In natural language processing (...
متن کاملThe Grammar Matrix: An Open-Source Starter-Kit For The Rapid Development Of Cross-Linguistically Consistent Broad-Coverage Precision Grammars
The grammar matrix is an open-source starter-kit for the development of broadcoverage HPSGs. By using a type hierarchy to represent cross-linguistic generalizations and providing compatibility with other open-source tools for grammar engineering, evaluation, parsing and generation, it facilitates not only quick start-up but also rapid growth towards the wide coverage necessary for robust natura...
متن کاملModularisation of Finnish Finite-State Language Description - Towards Wide Collaboration in Open Source Development of a Morphological Analyser
In this paper we present an open source implementation for Finnish morphological parser. We shortly evaluate it against contemporary criticism towards monolithic and unmaintainable finite-state language description. We use it to demonstrate way of writing finite-state language description that is used for varying set of projects, that typically need morphological analyser, such as POS tagging, ...
متن کاملGenerating Multilingual Personalized Descriptions from OWL Ontologies on the Semantic Web: the NaturalOWL System∗
We present Naturalowl, an open-source multilingual natural language generation system that produces personalized descriptions of individuals and classes, starting from a linguistically annotated ontology. The generator is heavily based on ideas from ilex and m-piro, but it is in many ways simpler and it provides native support for owl dl ontologies, which we annotate with linguistic and user mo...
متن کامل